Warehouse Creation - A Potential Roadblock to Data Warehousing
نویسندگان
چکیده
Data warehousing is gaining in popularity as organizations realize the benefits of being able to perform sophisticated analyses of their data. Recent years have seen the introduction of a number of data-warehousing engines, from both established database vendors as well as new players. The engines themselves are relatively easy to use and come with a good set of end-user tools. However, there is one key stumbling block to the rapid development of data warehouses, namely that of warehouse population. Specifically, problems arise in populating a warehouse with existing data since it has various types of heterogeneity. Given the lack of good tools, this task has generally been performed by various system integrators, e.g., software consulting organizations which have developed in-house tools and processes for the task. The general conclusion is that the task has proven to be labor-intensive, error-prone, and generally frustrating, leading a number of warehousing projects to be abandoned mid-way through development. However, the picture is not as grim as it appears. The problems that are being encountered in warehouse creation are very similar to those encountered in data integration, and they have been studied for about two decades. However, not all problems relevant to warehouse creation have been solved, and a number of research issues remain. The principal goal of this paper is to identify the common issues in data integration and data-warehouse creation. We hope this will lead: 1) developers of warehouse creation tools to examine and, where appropriate, incorporate the techniques developed for data integration, and 2) researchers in both the data integration and the data warehousing communities to address the open research issues in this important area.
منابع مشابه
Specifics of Financial Data Warehousing and Implications for Management of Complex ISD Projects
Data warehouses play important roles in the IT landscape of the financial industry. In the last years most of the leading banks implemented data warehouse solutions to fulfil regulatory or internal requirements. The institutes have to handle situations, which especially concern data warehouse projects in financial industry. On the basis of three case studies, observing DHW projects for a durati...
متن کاملThe role of boundary objects and boundary spanning in data warehousing - A research-in-progress report
Data warehouse projects bring together different communities of practice, with the primary objective of producing one body of information which is capable of comparative advantages in business analysis. Due to the number of involved communities and the complexity of their collaboration, data warehouse projects are costly. In this paper we give a closer look at communication problems on boundari...
متن کاملProposed Quality Evaluation Framework to Incorporate Quality Aspects in Web Warehouse Creation
Web Warehouse is a read only repository maintained on the web to effectively handle the relevant data. Web warehouse is a system comprised of various subsystems and process. It supports the organizations in decision making. Quality of data store in web warehouse can affect the quality of decision made. For a valuable decision making it is required to consider the quality aspects in designing an...
متن کاملRethinking Warehouse Design Research
The Keck Virtual Factory Lab (KVFL) was created to be a platform for computationally-based research on industrial logistics systems, and currently engages nine faculty and over a dozen graduate students. Initially, the KVFL has focused on warehousing and the development of integrated computational tools to support warehouse design and optimization, and warehousing-related courses. Recent collab...
متن کاملA Framework for Information Quality in a Data Warehouse: IQ in the context of Data Marts and Data Warehouses
Data warehousing technology provides integrated data from a multitude of sources that is non-volatile and transformed into meaningful information for decision-making purposes. As organization embrace data warehousing technology as a means of accessing information, the need for quality information within a data warehouse is imperative to the sustained success and use of this technology. There ha...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE Trans. Knowl. Data Eng.
دوره 11 شماره
صفحات -
تاریخ انتشار 1999